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RAW SEQUENCE LISTING 
ERROR REPORT 



BIO JHNOLOGY ^ <fH.> 
SYSTEMS- 
BRANCH 




Tlie Biotcclinolog>' Systems Branch of the Scicntinc and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 0-^ 



Source: 



Date Processed by STIC: r)i) Jj-l^o\ 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 c-mail help: patin21hcln(g)usnto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 c-mail help: patin3lielD@usnto.gov or phone 703-306-4 11 9 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U S PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3 0 application is a statc-of thc-art Windows based software program 
employing a logical and intuitive uscr-intcrfacc to check whether a sequence listing is in 
compliance with format and content mlcs Checker Version 3 0 works for sequence listings 
generated for the original version of 37 CFR §§ 1 82 1 - 1 825 efTcctivc October 1, 1990 (old 
rules) and the revised version (new rules) effective July I, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST 25 

Checker Version 3 0 replaces the previous DOS-based version of Checker, and is Y2K- 
comphant Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errorcd sequence 
listings, thus saving time and money 



Checker Version 3.0 can be down londcd from the USPTO website at the following address: 
littp://w\v\v.uspto.gov/wcb/ofriccs/|)ac/chcckcr 



• 



ERROR DETECTED 

ATTN: NEW RULES CASES; 

1 Wrapped Nuclcici 

Wrapped Aminos 



Invilld Lirvc Length 

MUali^cd Amino 
Numbering 

Non-ASCII 



Viriiblc Length 



6 Patcnlln 2.0 



Skipped Sequences 



(OLD RULES) 



8 Skipped Sequences 

(NEW RULES) 



9 L^ Use of n'l of Xui'i 
(NEW RULES) 

10 Invalid <213> 

Response 

11 Use or<220> 



12 Palentln 2.0 

"bug" 



■ ' i 

Raw Sequence LisClnc Error Summtry ''.^ - 

e.,r-r.FCTFnrORRECTION SERIAL NUMBER: :Z ^ 

PLEASE DISREGARD ENGLISH "ALPHA" HEADERS. WHICH WERE INSERTED BY PTO SOR 

■n.« numb«Acxt .t the end ofe.ch line -wrapped" down 10 the nexl line. Jlib nuy ocoir if your Hie 
w., retrieved in . word procewoc .fUr creating it Please adjurt yoor ngh. margin to .3; Ih., w.ll 
prevent •\vr»pping," - 

•n,e rules require that a line not eicted 72 character, in length, thi, include, white .paces. 

The numbering under each 5» amino acid is miuligned. Do not use Ub code, between number,; 
uie iptcc ch«r»cten, instead, 

■n.e .ubmitled f.le was not saved in ASCII(DOS) text, as required by the Sequence RuUs. Pleue 
eniure your tuliKfutnt tubmlajlon b .and In ASOl Uit. 

Sequence<.) conUin n'. or Xaa". repre«nting mire thai, one residue. Per Sequence Rule,, 
each n or X^TTan only reprt,ent . alnele re.ldue. Please present the maximum number ofeach 
residue having variable length and indicate in the <220>-<223> section that some may be m-ssmg. 

A-bug" in Patenlln version 2.0 ha, 'ciufed ftfe <220>.c22J>«ction to be missing from amino add . ' 
.,„,.JL^^.\ Normally, Patentin would-automaticaIfy.gei>er«Ie this section from the 

^JZvLl^i^^^c^^ «q-nce. Please manually copy th. relevant <"0>-<223> section .0 
L"ub,equent amino acid .equence. Thb .ppUe. to .he mandatory <220>.<2I3> .cctloru for 
ArtincUI or Unknomi Mquence*. 

Sequence<«) missing If intentional, please insert the following Unes for t.eh skipped sequence: 

(2)rNFORM7fiONFORSEQIDNO:X:(insertSEQIDNOwhere*'Y"/^^^^ ... ... 

(\) SEQUENCE CHARACTERISTICS: (Do n.ot insert any tubhcad.ngs under ihis heading) 

{J) SEQUENCE DESCRJPTIONiSEQ ID NO:X: (insert SEQ ID NO where "X" .s shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response lo Include the skipped sequences. 

missing. If Intentional, please insert the following lines for each skipped sequence 



Sequencers) 

<2 1 0> fequcncc id number 
<400> sequence id number 
000 

Use of n's and/or Xaa'i have been detected in the Sequence Listing. 

Per 1 823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or X.a's are present 

In <220> 10 <223> section, please explain location of n or X.a. and which residue n or X.a represents 

Per 1 823 of Sequence Rules, the only valid <213> responses are: Unkiiown^ Artificial Sequer.ce or 
scTenlinc nan.e(Gcnus/spocies). <220>.<223> section is required when <213> response .s Unknown or 
is Artificial Sequence 

Sequencers) missing the <220> "Feature" and associated numeric identifiers and responses. 

Use of<220>lo <223> is MANDATORY if <213> "Organism" response is " Artificial Sequence or 



"Unknown" Please explain source ofgenetic material in <220> to <223> section. 
(SeT^FeTe;alRegrster;- 06/01/1998. Vol. 63. No. 104. pp. 29631-32) (Sec. 1 .823 of Sequence Rule 

Please do not use "Copy lo Disk" function of Patentin version 2.0. This causes a comjpled file, 
resulting in missmg mandatory numeric identifiers and responses (as indicated on raw -q-nce 
Usting) bistead, please use "File Manager" or any other manual means to copy file lo floppy d.sk. 



0 



13 * Misuse of n 



n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 
any value not specifically a nucleotide. 

AMC/MH - Biotechnology Systems Dnnch - 08>7 1/200 1 



The typo of errors ahown exist throughout 

the Sequence Listing, Please check subsequent 

sequences for similar en-'^rs 

■ ■ - ' . 
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OIPE 



RAW SEQUENCE LISTING DATE: 08/08/2001 

PATENT APPLICATION: US/09/8 5 5 , 294A TIME: 10:48:05 

Input Set : A:\ES.txt 

Output Set: N:\CRF3\08082001\I855294A.raw 

4 <110> APPLICANT: Croce, Carlo 

5 Brenner, Charles 

6 Pekarski , Yuri 

10 <120> TITLE OF INVENTION: CRYSTAL STRUCTURE OF WORM NitFhit 

11 REVEALS THAT A Nit TETRAMER BINDS TWO Fhit DIMERS 
14 <130> FILE REFERENCE: CRO01.NP007 

16 <140> CURRENT APPLICATION NUMBER: 09/855, 294A 

17 <141> CURRENT FILING DATE: 2001-05-15 

19 <150> PRIOR APPLICATION NUMBER: 60/204,713 

20 <151> PRIOR FILING DATE: 2000-05-16 ^ - 
22 <160> NUMBER OF SEQ ID NOS : 11 

24 <170> SOFTWARE: FastSEQ for Windows Version 4.0 



ERRORED SEQUENCES 



346 
347 
348 
349 
351 
352 
353 
354 
355 
356 
357 
358 
359 
360 
361 
362 
363 
364 
365 
366 
367 
368 
369 
370 
371 
372 
E--> 373 
374 
405 
406 
407 



<210> SEQ ID NO: 8 
<211> LENGTH: 1359 
<212> TYPE: DNA 
<213> ORGANISM: Homo 
<400> SEQUENCE: 8 



sapien 



gtatatccag gagcttttaa 



atctttgggt gttttgccag agaagttggg cagccccac 
<210> SEQ ID NO: 10 
<211> LENGTH: 1214 
<212> TYPE: DNA 



gcttcagatt 


60 


ggcagcaacg 


120 


agcgaaatat 


180 


tgaagtagca 


240 


tgggaaatta 


300 


tagaaagatc 


360 


aacattgagt 


420 


gggcatctgc 


480 


ccagctgttg 


540 


acttcagcga 


600 


ggatgacaaa 


660 


ggttctagcc 


720 


gctggctgaa 


780 


tgtggagatg 


840 


gattctacaa 


900 


aaaaaaaaaa 


960 


cacgccacat 


1020 


tggttctgaa 


1080 


cttggtatcc 


1140 


aaagttgctg 


1200 


tttccaagca 


1260 


agggtgatac 


1320 




1359 



The type of errors shown exist throughout 

the sSuence Listing ^'ease check subsequent 

sequences for simiis; irr-rc 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/855 , 294A 



DATE: 08/08/2001 
TIME: 10:48:05 



Input Set : A:\ES.txt 

Output Set: N:\CRF3\08082001\I855294A.raw 



408 <213> ORGANISM: X. laevis 

410 <400> SEQUENCE: 10 

411 gtgagtgacg tgtgcgcagt ggcaactaag gcctcctggg aaaatgtaga ggagcgtgag 60 

412 cttcgcggga caggacaggg tcttaggctc tgccttgtgt ccacacgccc ttgtgcagac 120 

413 tgctatagac tgtgacttta accctgtgtc cggatatagg ggttagaagc ctgagtgcaa 180 

414 tggctggtgc ccacaagccc ctgattgccg tgtgccagat gacttcaacc tctgataagg 240 

415 agaagaattt cgccacgtgt tcgcggctga tccgggaggc tgcggggcgt cgcgcttgca 300 

416 tggtgtttct gccggaagcc tttgactata tcgggggcag cattgaggag acgctgagtc 360 

417 tggctgagtc tctacatggg gacaccattc agcgttacac ccaactcgcc agggagtgtg 420 

418 ggctctggct ttccctgggg ggatttcatg agaaaggacc caactgggac acggaccaac 480 

419 gcatttccaa ttctcacgtg gttgtggaca acacagggca catagtatcg gtgtaccgca 540 

420 aggctcacct gtttgacgta gacttgcaga atggagtgtc actcagagag agcagttcca 600 

421 ccctccccgg agcagagctt attcgcccca tcacttctcc agcaggaaag attggcctgg 660 

422 gggtgtgtta cgacctccgc ttcccagaat tctccttggc tctggcccaa caaggagcag 720 

423 aacttctcac ttacccttct gccttcaccc tcactactgg tctggcacat tgggaggtgt 780 

424 tgctgagagc ccgtgccata gaaacccagt gctacgtagt tgcagcggca cagacagaca 840 

425 gacacaatga gaagaggacg tcctatggtc acgctatggt ggtagacccg tgggggctgg 900 

426 tcattggcca atgccaggaa ggaacaggaa tatgttatgc tgagattgac attccctaca 960 

427 tggagcgtgt gaggcgggac atgccggtgt ggaggcaccg caggactgat ctgtatggga 1020 

428 aaatctcctt taataaaccc gactgactcc ataatggatc acctgcacct atgggggcaa 1080 
E--> 4 29 agnctttccc ctgattgctg aaattcctca atctgtgact gtgaatgaca atgaacgtga 114 0 

430 cttggaattg cctggttatg gcaccggcaa tgattctcta cagtaattct caataaaagt 1200 

431 gctgaacctt aaaa 1214 
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VERIFICATION SUMMARY DATE: 08/08/2001 

PATENT APPLICATION: US/09/8 5 5 , 294A TIME: 10:48:06 

Input Set : A:\ES.txt 

Output Set: N:\CRF3\08082001\I855294A.raw 

L:373 M:340 E: (46) "n" or "Xaa" used: Feature required, for SEQ ID# : 8 
L:429 M:340 E: (46) "n" or "Xaa" used: Feature required, for SEQ ID# : 10 
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STATISTICS SUMMARY DATE: 08/08/2001 

PATENT APPLICATION: US/09/8 55 , 294A TIME: 10:48:06 

Input Set : A:\ES.txt 

Output Set: N:\CRF3\08082001\I855294A.raw 

Application Serial Number: US/0 9/855 , 294A 

Alpha or Numeric: Numeric 

Application Class: 

Application File Date: 05-15-2001 

Art Unit: OIPE 

Software Application: FastSeq 
Total Number of Sequences: 11 
Total Nucleotides: 3865 
Total Amino Acids: 2382 
Number of Errors : 2 
Number of Warnings : 0 
Number of Corrections: 0 

MESSAGE SUMMARY 

340 W: 2 ((46) "n" or "Xaa" used: Feature required) 
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